Weighted Geometric Grammars for Object Detection in Context
نویسندگان
چکیده
This thesis addresses the problem of detecting objects in images of complex scenes. Strong patterns exist in the types and spatial arrangements of objects that occur in scenes, and we seek to exploit these patterns to improve detection performance. We introduce a novel formalism—weighted geometric grammars (WGGs)—for flexibly representing and recognizing combinations of objects and their spatial relationships in scenes. We adapt the structured perceptron algorithm to parameter learning in WGG models, and develop a set of original clustering-based algorithms for structure learning. We then demonstrate empirically that WGG models, with parameters and structure learned automatically from data, can outperform a standard object detector. This thesis also contributes three new fully-labeled datasets, in two domains, to the scene understanding community. Thesis Supervisor: Leslie Pack Kaelbling Title: Professor Thesis Supervisor: Tomás Lozano-Pérez Title: Professor
منابع مشابه
GenRGenS: software for generating random genomic sequences and structures
SUMMARY GenRGenS is a software tool dedicated to randomly generating genomic sequences and structures. It handles several classes of models useful for sequence analysis, such as Markov chains, hidden Markov models, weighted context-free grammars, regular expressions and PROSITE expressions. GenRGenS is the only program that can handle weighted context-free grammars, thus allowing the user to mo...
متن کاملExpressing Context-Free Tree Languages by Regular Tree Grammars
In this thesis, three methods are investigated to express context-free tree languages by regular tree grammars. The first method is a characterization. We show restrictions to context-free tree grammars such that, for each restricted context-free tree grammar, a regular tree grammar can be constructed that induces the same tree language. The other two methods are approximations. An arbitrary co...
متن کاملLearning Grammatical Models for Object Recognition
Many object recognition systems are limited by their inability to share common parts or structure among related object classes. This capability is desirable because it allows information about parts and relationships in one object class to be generalized to other classes for which it is relevant. With this goal in mind, we have designed a representation and recognition framework that captures s...
متن کاملA robust aggregation operator for multi-criteria decision-making method with bipolar fuzzy soft environment
Molodtsov initiated soft set theory that provided a general mathematicalframework for handling with uncertainties in which we encounter the data by affix parameterized factor during the information analysis as differentiated to fuzzy as well as bipolar fuzzy set theory.The main object of this paper is to lay a foundation for providing a new application of bipolar fuzzy soft tool in ...
متن کاملStudying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010